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A unified framework is proposed in this paper for parameter estimation 
^ I using convex optimization and experiment design applying convex maximiza- 

tion for Pauli channels, that can be extended to generalized Pauli channels, 
OO ■ too. In the case of known channel directions, an afhne parametrization of the 

^^ i Choi matrix turns the LS parameter estimation into a convex optimization 

C^^ ' problem also for the generalized Pauli channels. A simple iterative algorithm 

^^ ■ for estimating the channel directions is also given for qubit Pauli channels. 

The experiment design was performed by maximizing the trace of the 
Fisher information matrix of the output quantum system to find optimal in- 
put state and measurement POVM for the channel estimation. For the known 
channel direction case it was found that the optimal input state should be pure 
5^ I and the optimal measurement POVM is extremal. It was also shown that both 

the input state and the POVM elements in the optimal configuration should 
be parallel to the channel directions in the qubit Pauli channel case. 

The proposed methods and algorithms are illustrated by simple numerical 
examples. 
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1 Introduction 

Quantum systems are special stochastic nonlinear systems, where the stochas- 
ticity and nonlinearity are caused by the back-action of the measurements on the 
measured system [1], [2]. Therefore, even in the simplest static case, when the pa- 
rameters of a non-dynamic quantum system are to be estimated, one needs special 
estimation methods [3]; this case is called state tomography in theoretical quantum 
physics. 

Quantum channels are widely used information transfer devices in quantum in- 
formation theory [1] , that map an input quantum system into an output one usually 
in a static way. The task of the estimation of quantum channels - commonly known 
as quantum process tomography (QPT) in theoretical quantum physics - got a sig- 
nificant attention over about the last ten years. The problem was investigated by 
several authors [1], |3], |5]. The work [6] gives a comprehensive survey on the dif- 
ferent strategies used for process tomography (or channel estimation). The problem 
can essentially be formulated in two type of methods: direct, and indirect. In the 
indirect method, we trace the problem back to quantum state tomography, i.e. the 
information about the unknown quantum channel is obtained by sending known 
probe quantum systems through the channel, and performing state tomography on 
the output states. In contrast, in the direct method, the experiments directly give 
information about the channel, without the need for a state tomography step. 

From a methodological point of view, there are two principally different ap- 
proaches to the problem of quantum tomography, the statistical approach and the 
convex optimization based approach |1]. The former gives information on the statis- 
tics of the estimate and on its covariance matrix, but it has the drawback, that it 
is hard to compute in higher dimensions. In spite of this, majority of the existing 
methods belong to this category. 

In contrast to this, an optimization based method does not give as much infor- 
mation, but it is relatively easy to compute. This approach has been pursued in the 
work [7] where the problem of channel estimation (in the form of the Choi matrix, 
which is a Hermitian matrix representing the channel) is considered, assuming a 
completely general channel, thus without any assumption on the inner structure of 
the Choi matrix. The author uses random input states, and random measurements 
on the output, and formulates a maximum likelihood problem. A similar method 
is used in the work j8], which formulates the task of process tomography as a least 
squares problem, which is convex. It also uses the Choi matrix as optimization vari- 
able, thus searches the optimal channel in the convex set of all completely positive 
and trace preserving (CPTP) maps using multiple input-measurement pairs. 

However, as it is stated in [H], it is a reasonable assumption to consider only a 
certain family of channels given with a model, based on a priori knowledge about the 
structure of the channel. One of the aims of our work is to develop a method which 
is capable of incorporating these constraints into the channel tomography problem, 
while still remaining - at least partially in the general case - solvable by convex 



1. INTRODUCTION 



optimization. 

It is a coninionly known fact that system identification is intimately related to 
experiment design, the general aim of which is determining experimental conditions 
that result in good or even optimal identification results |T0]. Thus the method of 
identification, that is, model parameter and structure estimation, determines the 
methods applied for experiment design, too. In addition, the nature and properties 
of the system to be identified have also a determining influence on identification and 
experiment design. The experiment design for quantum channel parameter estima- 
tion includes the design of the quantum input to the channel, and the observables 
to be applied on the resulting quantum output system, that is called the experiment 
configuration, together with the number of measurements to be performed in the 
different experiment configurations if one has a few of them. 

The results on experiment design for quantum state and channel estimation ap- 
pear sparsely in the quantum state and process tomography literature, when the 
authors investigate the optimality of their experiment configurations. The problem 
of optimal experiment design for quantum state tomography was first investigated by 
Kosut et al. [B] who developed methods using convex optimization for the determi- 
nation of the number of measurements to be performed in the different experiment 
configurations. Since then, a few more papers can be found about optimal experi- 
ment design for quantum state estimation (see e.g. [11] for a recent paper), but the 
problem is far from being solved for all cases. 

The problem of finding an optimal estimation of one parameter quantum chan- 
nels is discussed in [12], for both the single qubit input and the entangled qubit input 
case using statistical methods. An efficient estimation scheme is proposed in |13] . 
where the quantum Fisher information and information geometrical considerations 
lead to an optimal measurement configuration for generalized Pauli channels that 
act on more than 2-dimensional quantum systems with entangled finite dimensional 
quantum inputs. Although the restricted experiment design problem, i.e. the deter- 
mination of the number of measurements to be performed in the different experiment 
configurations for quantum channels has also been formulated in the pioneering work 
of [8], it is much less investigated than its state tomography counterpart. A recent 
paper of [14] gives a good overview of the state-of-the-art in this field. 

Motivated by the above experiment design problems for quantum process to- 
mography and by our recent work of optimization based quantum channel estima- 
tion [15], the aim of this study is to propose a unified framework for parameter 
estimation using convex optimization and experiment design applying convex max- 
imization for Pauli channels, that can be extended to generalized Pauli channels, 
too. 

The rest of this paper is organized as follows. First the basic notions on quantum 
systems, quantum channels and quantum parameter estimation are described for 
the finite dimensional case in the next section. Then the convex optimization-based 
channel parameter estimation method is given in section [3l This is followed by a 
section about estimating the channel directions. Section [5] is devoted to the proposed 
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experiment design method that is based on convex maximization. A separate section 
thereafter illustrates the advantages of the proposed experiment design method using 
case studies. Finally conclusions are drawn. 

2 Basic notions 

Some basic notions of quantum systems are described in this section for the finite 
dimensional case. 

2.1 Quantum Measurements and Fisher Information 

2.1.1 State Representation of Finite Dimensional Quantum Systems 

The state of a finite dimensional quantum system is described by a so called 
density operator or density matrix p that acts on the underlying finite dimensional 
complex Hilbert space Ti. Density matrices are self-adjoint positive semidefinite 
matrices with unit trace, i.e. 

p>0, p* = p, Tr(p) = l. (1) 

where p* denotes the adjoint of p. 

Two-level quantum systems are called quantum bits, their density matrices are 
2x2 complex matrices that are of the form 



1 
P 



1 + ^3 Ui-tU2 ^^l j^s^ 



1 + ^3 Oi- 102 
9i + 102 1 - 03 



M/ + ^^,a,] , (2) 



j=i 



where (Ti, a"2, (73 are the so-called Pauli matrices, and I is the unit matrix. The vector 
= {01, 02, ^3)^ is in the 3-dimensional unit ball of M.^. This state representation is 
called Block vector. 

Let \4>i^i) and |0j,2) be the normalized eigenvectors of ai {i = 1,2,3). Then 

ai = |0i,i)(0i,i| - |0i,2)(0i,2| (i = 1,2,3). 

It is well-known that 

\{Mhi)\' = l i^y'j) (3) 

which means that the three bases {{(pi^i), |0j,2)} (^ = 1,2,3) are mutually unbiased, 
and form a so called MUB (mutually unbiased bases). 
In this formalism we have 
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Y.^i\(t>i^){4>^A ■ (4) 



Note that we can uniquely represent any density matrix with its Bloch vector 9 
given the MUB. 

2.1.2 Quantum Measurements 

Quantum measurements are described mathematically as 
M = {Ml, ..., Mm}i where the self-adjoint positive operators Mi act on the Hilbert 
space and Mi + ... + Mm = I- Such M = {Mi, ..., M^} is called to be a positive 
operator-valued measure (POVM). If the positive operators Mj are all projections, 
then we get a so called projective measurement. 

If a POVM M is performed as a measurement on the state p, then the possible 
outcomes are 1,2, ... ,m and the probability of the outcome i is Tr(pM(). Let pe 
be a parametrized quantum state and a POVM M = {Ma : a E A} is used for 
the measurement, where A denotes the set of measurement outcomes. Thus the 
probability distribution of the outcomes is 

pia\e) = TiipeMa) (a e A) , (5) 

which is a set of probability distributions parametrized by 9. 

2.1.3 Fisher Information 

The Fisher information reflects the amount of information that a measured ran- 
dom variable can carry about the parameter 9 = {9i, . . . ,9kY' . In other words, it 
measures the accuracy of the unbiased estimator 9 of 9. Fisher information is a 
classical concept in statistics. 

The accuracy of the estimator is expressed by the covariance matrix: 

Var(^),,,- = E{9i9^) - E{9,)E{9j) , 

where E is the expectation value. 

The Fisher information matrix for the quantum setting is by ^E\ 

and the Cramer-Rao matrix inequality describes their relation: 

Var(^) > F{9Y^ 

This bound shows that the higher the Fisher information, the better estimation we 
can have. The formula for F also shows that in the quantum case, it depends on the 
actual measurement POVM M with which the experiments had been performed, 
i.e. F{9,yi). 
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2.2 Quantum Channels 

Quantum channels model the information transfer between quantum systems, i.e. 
they transform the source quantum system into a target one. A quantum channel 
£ : B{'Hi) — )■ B{'H2) is defined to be a completely positive and trace preserving 
(CPTP) map, where B{'Hi) is the operator algebra on the Hilbert space Hi. This 
map can be represented by a set of operators, Vi : "Hi — )> 7/2, called the Kraus 
representation, which gives the channel output as 

£{p) = Y.v,pv: , (7) 

i 

and the operators must satisfy the relation 

Y.v:v, = i, (8) 

i 

in order to represent a trace preserving map. 

We have to mention that the set of operators in the above representation is 
not unique. This drawback can be eliminated with another possible description of 
channels using the definition of the Choi matrix. Let "Hi and 7/2 be Hilbert spaces, 
and E : B{'Hi) — ?> B{'H2) be a linear mapping that represents the quantum channel. 
To define the Choi matrix of £ we take an orthonormal basis /i, ..., /„ in T-ii. Then 
\fi){fj\ G B{'Hi) and £ acts on this operator. Then the Choi matrix according to |2] 
is 

Xe = $^l/.)(/.l ®^(l/.)(/,l) e B{n,)®B{H,) . (9) 

where denotes the tensor product. Actually, the above matrix is a block matrix, 
its ijth element is £{\fi){fj\)- The complete positivity of £ is equivalent with the 
positivity of X^. Furthermore, £ is trace preserving, if 

Tr[£{m{f,\)]=TT{m{f,\)=6,, 

which means Tr2(X£-) = /. 

2.3 Pauli Channels 

A notable wide class of quantum channels are the Pauli channels. 

2.3.1 Qubit Pauli Channels 

In the qubit case when the input density matrix p has the form ([2]), 

3 



^(P) - \ 



1 + A3^3 Ai^i - 1X262 

X\9i + iA2^2 1 ~ ^36*3 
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is a simple example of a qubit Pauli channel. The Choi matrix of this channel is the 
following: 

I + A3 A1 + A2 



X, 



1 I-A3 A1-A2 

Ai - A2 1 - A3 

A1 + A2 I + A3 

The conditions of positivity for this matrix in terms of the parameters are 

|1±A3| > |Ai±A2| , 

and the trace preserving requires 

\\i\ < 1 , 2=1,2,3. 

The typical Pauli channel acts on a density matrix 



'11^ 



:i2i 



p 



\ 4=1 / i=l 



as 



S{p) = ^ ^l-Y^\^e^I + Y^\A\(|>i^){(t>^^\■ (13) 

To describe this channel we need the three real constants Ai, A2, A3 (satisfying ( ITTl) 
and ( IT2|) ) and the vectors |0i,i), |02,i), 103, 1) (satisfying ([3])). So a Pauli channel is 
given by 6 data items. Below the vectors |0i,i), |02,i), |03,i) will be called channel 
directions. The 

Ai = {a|0i,i)(0*,il + ^l'/'i,2)(04,2| : a,6 G C} {i = 1,2,3) 

are commutative subalgebras. The effect of the channel can then be described as 
depolarizing in each subalgebra Ai with the corresponding parameter Aj. 



2.3.2 Generalized Pauli Channels 

The generalized Pauli channel is discussed in [17]. Assume that the operator 
algebra B{'H) contains subalgebras ^1, ^2, • • • , Au which are complementary in the 
sense that 

Tr {AiA,) = if Tr (A) = Tr {A,) = 0, A, G A, A, G A,, i ^ j. 

The trace preserving projection Ei : BlTi) — )■ Ai is usually called conditional expec- 
tation. If d is the dimension of Ti, then for any input A G B{'H) in the form 



(M-l)Tr(A) v^^,,. 
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S{A)=(l-±x)l^I + ±XMA) 



(14) 



is a generalization of the Pauli channel. In the classical Pauli setting the 

A = {a|0.,i)(0i,i|+6|0.,2)(0i,2| :a,bGC} (z = 1,2,3) 
are commutative subalgebras and, for example 

" 1 e^ 



El 



1 + ^3 01- 102 
01 +102 1- 03 



1 



Similarly to this example here we consider the case when all of the complementary 
subalgebras are maximal Abelian. 

In this case, we can take u orthonormal bases in the d dimensional Hilbert space 

n-. 

{\(l)Lk) ■■ I < k < d} l<i<u. 
Similarly to the Pauli case, it is assumed that 

o 1 



\k\(Pj,0 



d 



(^^J), 



(15) 



i.e. the bases are mutually unbiased. Then the relevant subalgebras are 

Ai= I ^Cfc|0i,fc)(0i,fc| : Cfc G C > (1 < 2 < m) , 

and the conditional expectations are 

d 

Ei{A) = ^(0j,fc,A(^i,fc)|0j,fc)((^. 



i,k\ 



fc=l 



So the generalized Pauli channel is defined by the formula (IMj) . The conditions for 
complete positivity and trace preserving property are 

1 



l + dX,>Y^ Xj > 



d-1 



\XA < 1. 



(16) 



3 Quantum Channel Parameter Estimation as an 
Optimization Problem 

The parameter estimation of quantum channels, or quantum process tomography 
is a widely investigated problem in mathematical physics. The pioneering works of 
Kosut and co-workers |8] formulated its variants as a convex optimization problem. 

The formal mathematical description of the general quantum process tomogra- 
phy problem is as follows. Consider an unknown quantum channel S : i3('H) — !■ 3(71), 
which is to be estimated. We use a so called tomography configuration for this pur- 
pose that contains the following elements: 
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e[p^) = a^ 



POVM 



M. 



{Ma,-y} 



Outcome counts 



{c, 



a, 7) '''7 



trials } 



Figure 1: The scheme of data collection for process toraography. 

• A known input density operator p on the Hilbert space "H of the system. 

• A POVM with which we can perform quantum measurement on the channel 
output state £{p). 

From the above, we compute an estimate of the channel parameters using the mea- 
sured data and the parametrized model of the channel. 

Note that we can use multiple different tomography configurations, i.e. different 
input states and POVMs in order to achieve better estimation on S. In this work, the 
(input, POVM) pair corresponding to the 7th configuration is denoted by (p^, M^). 

3.1 The Estimation Method 

3.1.1 Experimental Data Collection 

The first stage of process tomography is the collection of the measurement data. 
The measurements are performed in each 7 configuration n^ times independently. 
During data collection, the different a outcomes of the measurements in the config- 
uration 7 are counted in the variable Ca^-y, and put in the measurement record R. 
This scheme can be seen in Fig. [1] Then obviously Yla'^o',! ~ '"•7- Thus, we have to 
perform a total number of ntot = Yly "^i independent measurements. The estimator 
£ of the channel £ will be calculated from these measurement outcomes. 



3.1.2 Least Squares Estimation 

If £ were given in its Kraus representation, then the resulting LS optimization 
problem would be nonconvex, because the optimization variables would be the Kraus 
operator elements V^. To overcome this difficulty, it is reasonable to choose the Choi 
matrix as optimization variable. 

By the use of relations (jS]) and (I9]) we get [I5] 



V> 



0,7 



Tl{Ca,'yX£) 



where X^ is the Choi matrix of the channel £", and the configuration matrix Ca,-^ = 
p^ ® M* depends on the channel input p and on the measured POVM elements in 
configuration 7. 
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The probability Pa,'y can be estimated as 



P.,7 = — (17) 



using the relative frequency that can be calculated from the measurement results. 
The variance of this unbiased estimate after n^ independent measurements is known 
to be 

VaT(pa,y) = Pa.-yfl -Paa) , (18) 

because Pq, ,y has a binomial distribution. These show that for large ra^, jOq,^^ — )■ pa^^ 
and Var^po,^^) tends to as ri^ — )■ oo, so j3q, ,y is a reasonable unbiased estimate of the 
real value Pq,,^. This leads to formulating the parameter estimation as the following 
least squares problem: 

p„,7 -Tr(C„,^Xf) , (19) 

a, 7 

SO that Xe > 0, Tr2(Xf) = / 

This problem is a convex optimization problem in the Choi matrix X^, thus it 
can be solved relatively easily using existing numerical algorithms [T8 | [T9 ] . 



3.2 Estimation of Pauli Channel Model Families with Known 
Channel Directions 

In practice it is reasonable to assume that we know a model type of the channel, 
and only the unknown values of the parameters of this model have to be estimated. 
In such a problem, the above derived least squares objective cannot be used directly, 
as it assumes a completely general channel model, and estimates the elements of the 
Choi matrix. Thus, if the task is to estimate some specific model parameters, this 
method can suffer significantly from overparametrization. 

Some authors proposed approaches based on prior information on the channel, 
thus obtaining a well conditioned parameter estimation problem. These are mainly 
derived from physical interactions involved in the dynamics [H], and in |20] the 
authors also consider the problem of finding the optimal series of experiments to 
estimate the channel parameters. 

As another possible solution, we can study the internal structure of the Choi 
matrix, and use this information to select more appropriate, model specific param- 
eters for optimization. Effectively, this should reduce the set of optimal solutions of 
problem (fT9|) to solutions, which are consistent with the desired model family. 

3.2.1 AfRne Approximation 

The natural choice would be to select just the unknown channel parameters. 
However it can be easily seen, that this choice would ruin convexity, as the Choi 
matrix can be an arbitrarily nonconvex function of these in the most general case. 

10 
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Thus, instead of this, the following method is used. Let hi{X) , . . . , hm{X) de- 



note functions of the channel parameters, and let Hq, Hi 



Hm denote constant 



Hermitian matrices. Then we can expand the Choi matrix as an affine function |15] : 



X£ = J2Hkhk{X) + Ho 



(20) 



This way we can approximate the Choi matrix by an affine structure, and use the 
functions hk{X) as optimization variables. 

Note that the trace preserving constraint can be omitted in (IT9|) . as it can always 
be taken into account in the model construction. 

3.2.2 A Pauli Channel Parameter Estimation Example 

Assume that the channel directions are known or have been determined (see 
section H] later). Using f l20l) . the Choi matrix of the qubit Pauli channel can be 
decomposed using the following Hermitian matrices: 



^0 = 77 



H 



2 — 7: 
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H. 
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1 
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1 













1 





-1 










2 










-1 


















1 



and the new optimization variables will be 

/ii = Ai, /12 = A2, h^ 



A, 



Thus, in this representation, the parameter estimation of any two dimensional Pauli 
channel is a convex problem, as the optimization variables are exactly the channel 
parameters to be estimated. 



3.2.3 The Case of Generalized Pauli Channels 

The above example can be generalized to the higher level Pauli channel case 
assuming again known channel directions. For the sake of simplicity we choose the 
simplest 3-level (qutrit) case, when d = 3 in sub-section 12.3.21 The used MUB will 
be the bases suggested in |2T]. Then the Choi matrix of the channel will be the 
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following: 



Xe = ^ 



h 





/3 





h 





/2 





n 

h 









h fX 



h 





ft 





/2 





/4 





/3 





h 





h 





h fX 








/2 

n 









h 

/2 









h 

n 





/2 





/3 





/a 




/l 



where 



/i = 1 + 2A2, /2 = 1 - As, /s = Ai + A3 + A4, 
/4 = Ai-^(l + iv^)-^(l-iv^), 

Note that the d = 2 case using the same MUB selection method is exactly the same 
as the qubit Pauli channel in subsection 12. 3.11 

The decomposition ( pOj) of the Choi matrix can be calculated easily given that 
the optimization variables are again exactly the channel parameters: 



hi — Aj, 



A 



This, and the construction of the channel shows that the problem of parameter 
estimation is convex, and solvable using only (IT^ in any such - arbitrarily high - 
dimension, in which the channel itself can be defined. 



4 Estimating the Channel Directions 

During the parameter estimation of Pauli channels it is generally assumed that 
the Pauli channel directions are known. This, however, is not true in general so this 
section describes a method to estimate these directions, while resulting in a first 
estimate on the parameters, too [15j . 



4.1 Qubit Case 

If we do not know the exact three directions |(/>i,i), |(^2,i), |03,i) in which the 
Pauli channel is depolarizing, then quantum state estimation steps can be used to 
determine this structure. 

Let us fix three vectors |v5i,i), 1 922,1), 1 923,1) satisfying ([3]). Then the operators 
\(pi,i) {(pi,i\ , i = 1, 2, 3 formed by the channel directions can also be expressed in the 
form of dl]) on the MUB determined by |v?i,i), 1 922,1), 1953,1) with Bloch vectors Vi, 
V2, and V3. These vectors form a basis in M^. Let us further assume that the input 
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qubit to the Pauli channel is represented by the Bloch vector b in the {vi} basis 
representing the channel directions. 

Then, the effect of the channel for the input Bloch vector b (||b|| < 1) can be 
written as 

3 3 

b = ^ biYi -> ^ Xikvi . 

i=l i=l 

In the rest of this section, the words "vector" and "state" are used as synonyms, 
both referring to Bloch vectors. 

4.1.1 The Case of Different Channel Parameter Values 

Assume that all of the Aj channel parameters have different absolute values, 
therefore |Aj| < 1 , z = 1,2,3. Prepare a pure state b with ||b || = 1? and use it 
as input to the channel. Then the output b*^^^ can be expanded in the {vj} basis, and 

we get that each component of b got scaled by the corresponding Aj parameter. 
As |Aj| < 1 from the positivity and trace preserving constraints fITT]) . the absolute 
value of each nonzero component will get smaller. Let the channel parameter with 
the largest absolute value be A^. Then the absolute value of component bm will 

decrease relatively the least among nonzero components of b .In other words, the 

value '(„) will get bigger for i = m, and smaller for i ^ m. 

If we continued this procedure, and put the channel output b'"^ back into the 
channel as input to get the output h^'^'^^' , then by the above argument, the value 

'(„) will converge to 1 for i = m, and to for z 7^ m. Thus the sequence 



b(") 
|b(")| 



n=0 



will be a Cauchy sequence, and will converge to the direction v^ that corresponds 
to the parameter A^ with the largest absolute value. The normalization in the above 
sequence is inevitable, as the output states do not remain pure during the iterated 
channel effect, i.e. the length lib'-"' 11 of the sequence will not remain 1, it will converge 
to zero instead. 

Thus, to prevent the vector sequence from converging to the maximally entangled 
state, we have to do the normalization of the output Bloch vector b*-"' manually 
after each step. This means that we have to exchange the output state with the 
pure state which points in the same direction. Thus we need to perform quantum 

state tomography. After the normalization of b^"% we get the pure state b , which 
can be put again in the channel. This way, the Cauchy sequence of vectors converge 
to Vm. 

The accuracy of this procedure has of course a limit, set by the accuracy of 
quantum state tomography. Convergence to a channel direction is guaranteed only 
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until the difference in the input and output state is not comparable with the un- 
certainty of the state estimation procedure. Thus, when the sequence reaches this 
limit, the searching procedure should stop. It can also occur that we give a good 
initial guess, and start with an input state which is close to a channel direction. 
Then that direction can be accepted, as slow convergence can only occur close to 
channel directions. 

After the first channel direction v^ was found using this procedure, we can 
continue the search in the plane orthogonal to v^- However, due to the inaccuracies 
in state tomography, the direction we will find will not be exactly v^, rather some 
vector b ^ v^- Thus, it is more robust if we apply a projection to the output vector, 
onto the subspace in which we want to do the searching. When the second direction 
is found, then the third can be easily obtained, as it will be the one orthogonal to 
both the first and the second direction. Thus the direction estimation procedure is 
finished. 

4.1.2 The Case of Equal Channel Parameter Values 

In the degenerate cases when some of the channel parameters Aj have equal 
absolute values, then the channel is equally depolarizing in the linear span of those 
directions, i.e. there are no exact channel directions defined in that subspace. This 
means that we can use any state inside this subspace as channel direction, so the 

sequence {b } _„ of states is only required to converge to an arbitrary state inside 
this subspace, which is guaranteed by the above procedure. 

It follows, that if all the channel parameters have equal absolute values, then the 
channel is the depolarizing channel, which means that any three orthogonal Bloch 
vectors can be used as channel directions. 

4.1.3 Algorithm for Direction Estimation 

The procedure described in the previous subsections can be summarized in the 
following algorithm. We would like to estimate the three depolarizing directions of a 
qubit Pauli channel S. Let the set of found channel directions be D. We start with 
D = {} and n = 0. 

1. Prepare a pure state b G D . 

2. Put b into the composite channel S'^ formed by cascading k instances of the 



channel £, then get the output b^"""*" ' . 

3. Perform quantum state tomography on b^"^ '. 

4. Normalize b'"^^' to get the pure state b 

5. Project b to the subspace D . 
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6. If the distance lib — b 11 is smaller than the some value determined by 
the variance of the used quantum state estimation method, then continue with 
step ([7]). Else increase n by 1 and continue with step ([2]). 

7. Put b into the vector set D, set n to then restart with step (JT]). 

This algorithm - though rather resource intensive - thus estimates the directions 
of a Pauli channel. During the algorithm, we can get information also on the channel 
parameters, which can be made more accurate using the optimal tomography con- 
figurations described in Section [5l thus making a two step Pauli channel estimation 
procedure. 

4.2 A Simple Numerical Example 

In order to illustrate the operation and properties of the above proposed channel 
direction estimation algorithm, a simple illustrative numerical example is presented 
here for a qubit channel with different parameters Ai = 0.6, A2 = 0.3 and A3 = 0.1. 

The three unknown channel directions were chosen to be the eigenvectors of the 
Pauli matrices. The uncertainty in the estimated channel output state arising from 
quantum state tomography was simulated using random perturbations in the output 
state. The perturbation for the zth direction is a random term added to the Bloch 
vector component 9i, and is of the form 



,,'1-"' 



N ' 

where ^ is a random number taken form the standard normal distribution, A^ is the 
number of measurements in the state tomography step, and ^^ is the variance of 
the estimator 9i. 

The result of the numerical test can be seen on Figure [2l The three unknown 
channel directions are shown by the black axes in the Bloch sphere. The colored vec- 
tors indicate the perturbed and normalized input states in each step, and the white 
vectors indicate channel outputs. The starting input vector were chosen randomly 
at the beginning of the search for each direction. The perturbation in the output 
states assumed A^ = 5000 measurements in the state tomography steps. The red 
vectors correspond to the direction with parameter Ai of the largest absolute value, 
the green vectors correspond to the direction with parameter A2, and the blue vector 
correspond to the remaining direction with parameter A3. Five iteration steps were 
performed in each of the two estimated directions. 

It can be seen from the figure, that the sequence of input states converge to the 
channel direction of highest absolute parameter value in the subspace D in five 
iteration steps. 
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'T-"' * 




Figure 2: Example on channel direction estimation for the qubit channel with parameters Ai = 0.6, 
Aa = 0.3 and A3 = 0.1. 



5 Experiment Design in the Known Channel Direc- 
tion Case 

The field of experiment design for quantum channel parameter estimation has 
not matured yet, even the problem statements have not cleared up. Only a few 
papers exist that aim at determining the elements of the tomography configuration, 
i.e. the input quantum system and the measurement POVMs, (see e.g. |12] . |13]). 
These papers, however, fix one of the elements - the input quantum system, for 
example - and determine the other (say the POVM) according to some optimality 
criteria. The only paper that uses an optimization approach to experiment design 
solves a restricted experiment design problem, i.e. the determination of the number 
of measurements to be performed in the different experiment configurations j^. 

In this section an experiment design problem of the whole tomography configu- 
ration is formulated and solved in the form of a convex maximization problem. 



5.1 Problem Statement 

Suppose we have a quantum channel £x with some fixed channel parameter vector 
A. We would like to find the input state p and a measurement POVM M for which 
the Fisher information F(A) of the channel parameters estimated from the channel 
output £\{p) using the POVM M is maximal. As the Fisher information will be a 
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matrix, by maximization of F{X) we mean the maximization of an appropriately 
selected scalar function of F{X). 

As we have seen before, the probability p{a\X) of the measurement outcome a of 
the state Sx{p) can be rewritten as TT{pxMa) = TT{CaXx), where Cq = [p^ ® M^) 
is the configuration matrix. Using this, the Fisher information will be 

i^(^)i..- = E wk) ^^' '^^'-'"'^ ^^^ '''"-^^' ■ 

To be able to do the maximization, a scalar valued objective function is needed. 
By the property Tt{A) < Tt{B) whenever A < B for the Hermitian matrices A and 
B, we take the trace of the Fisher information matrix: 

^W = E™7k>(7^Tr(C„A-,))\ (21) 



_^ Tr(C„A'j) \d\ 

It can be shown that the function F is convex in the configuration matrix Ca 
on the set of valid Ca matrices, thus convex both in the input p and the used 
measurement POVM M if we fix the other to be a constant. 

From this it follows that F should take its maximum at an extremal point of the 
feasible region containing the possible experiment configurations. Thus, the optimal 
input state will be pure, and the optimal measurement POVM will be a so called 
extremal POVM 1221. 



5.2 The Optimal Configuration for Qubit Pauli Channels 

Now we study the case of qubit Pauli channels, and will assume that the three 
depolarizing directions of the Pauli channel are known. Because of the rotational 
symmetry of the Bloch ball, the obtained results can be applied to any other Pauli 
channel, with different directions. 

The experiment design problem is solved for projective measurements, which 
can be represented with two-element extremal POVMs {\i/j) {ip\ , I — |'?/')('0|} |22) . 
Let these POVM elements be represented with the Bloch vectors m and — m with 
||m||2 = 1. Let also the pure input state be in Bloch parametrization ([2]), with the 
Bloch vector denoted as b. 

Then the channel output with channel parameter vector A = [Ai, A2, Aa]^ will be 
(fTOJl . and if we write the trace of the Fisher information matrix of the the channel 
parameters, we get 

1 - {niibiXi + m2b2X2 + m^b^Xsy 

Recall that the unit length requirement on the vectors b and m follows from the 
convexity of F, which we want to maximize. Note also that the above formula is a 
special case of Eq. (12T]) . 
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Let us now define the vector c = [mibi, rn.262, "msbsf^ , which can be tought of as 
the configuration vector of the channel estimation problem, which includes not only 
the input state and measurement information, but in this case also the assumptions 
on the channel structure. The objective (l22l) will then be 



F(A) 



T T 

c c c c 



^A)2 1 - cTAA^c ■ 



By Holders inequality it is easy to see that the set of all possible c vectors form an 
octahedron inside the Bloch sphere, whose vertices are the unit vectors pointing to 
the three directions of the channel. Thus the set of all c vectors is convex, moreover 
we have equality if and only if |6jp = |mjp, i.e. when the vectors b and m are 
parallel. 

As the objective is convex in both b and m and thus in c we know that it takes 
its maximum at a vertex of the octahedral feasible set. Thus the optimal c has not 
only unit 1-norm, but unit 2-norm, too. This can only happen if only one component 
of c is nonzero, which means that both the input and the measurement have to be 
in the same channel direction. This implies that the objective is maximized clearly if 
the direction of c is that direction, for which |Aj| is maximal. Let this be for example 
Ai, then the optimal objective will be 

F(A) = 



Now, we see that performing experiments in this direction does not give any 
information on the other directions, so we have to search for additional experimental 
configurations. Let the direction of the optimal configuration found first be the 
direction x, i.e. for i = 1. If we now constrain the objective ( 122|) to the plane 
orthogonal to x, then we get the constraints rui = and 61 = 0. Using the same 
derivation as in the general three dimensional case, we get that the next optimal 
configuration will be the y (with i = 2) or z (with i = 3) direction, and so on. 

As a conclusion, the three Pauli channel directions can be used as optimal direc- 
tions for both measurements and input states. 

5.2.1 Optimal Parameter Estimation of Qubit Pauli Channels 



The general least squares objective function in Eq. flT9l) used for process tomog- 
raphy can be simplified using the optimal experiment configuration. 

For the case of qubit Pauli channels, we can express the outcome probabilities 
in the configuration 7 as 

lic^A 
P±n = 7^ • 
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Substituting this into the objective function (fT9l) . we get 

>^/ l±cTA\' , , 

arg mm ^ ( p±,^ ^ 1 , (23) 

SO that |1 ± A3I > |Ai ± A2I . 

If we assume that the channel is truly Pauli, then we do not need the constraints on 
the parameters, because the global minimum of the objective function will be inside 
the feasible region. Applying the optimal tomography configuration, we get 

arg mm ^ ( j5±,^ ^ j , (24) 

which can be written as 

1 3 

-A^A + (p„ - p+)^A + p^p+ + p^p_ - -. 

Here the p_j_ vectors contain the measured +1 and —1 outcome probabilities (i.e. 
the relative frequencies) for each configuration. Setting the gradient equal to zero, 
we get the optimal estimator for the channel parameters: 

^ = P+ - P- • 

Thus if the optimal configurations are used, then the qubit Pauli channels can be 
estimated in a very simple and efficient way. 

5.2.2 Optimal Configuration for Generalized Pauli Channels 

Unfortunately, the above derivation does not generalize to the higher dimensional 
case in a straightforward way. Therefore, numerical optimization can be used to find 
optimal measurement configurations. A case study can be seen for the qutrit case 
in subsection 16.3.11 The example suggests that in higher dimensions, when the com- 
plementary subalgebras Ai are maximal Abelian, the optimal set of configurations 
will be similar to that of two dimensional Pauli channels. 

Namely, we will need d + 1 configurations, with a pure state p G Ai as input, 
and the POVM \Ma,i\ as measurement, where M^^i G Ai is also pure in the zth 
configuration. 

6 Case studies 

The aim of the simulation experiments was to analyze the effect of experiment de- 
sign on the performance of the numerical optimization based estimation of quantum 
channels. Results were generated in MATLAB environment, using simulated ran- 
dom measurement data. The optimization problem (IT9|) was solved using YALMIP 
modeling language |23] and the SDPT3 solver 
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6.1 Tomography Configurations 

The experiments in the nonoptimal test cases were set up as follows. 

• The used input states were all pure states. 

• To obtain a toniographically complete measurement, appropriate POVMs were 
selected on the Hilbert space of the system, and were used for measurement. 
In two dimensions, these were the Pauli matrices, and observables with simi- 
lar properties in higher dimensions. Each of these can be decomposed into a 
POVM, which can be used in one configuration. 

• The total ritot number of measurements was distributed among all the config- 
urations equally, i.e. for each configuration 7, an equal number of experiments 
were used. 

Each experiment setup was repeated five times and their average was taken. Each 
of the estimated process Choi matrices Xg and channel parameters were analyzed 
using the following estimation performance measuring quantities: 

• The empirical mean A of the estimated parameters A. 

• The empirical covariance matrix of the estimated parameters A, where only the 
diagonal elements of the covariance matrix, i.e. the variances of the parameters 
were computed as 

1 ^ 
Var(A.) = -^(A,,-A.)^ 

• Hilbert-Schmidt norm of the estimation error: \\Xs — X£\\. 

6.2 Qubit Pauli Channels 

In this section we show some examples on process tomography which demon- 
strates the differences between nonoptimal and optimal experiment configurations. 
The channel parameters in each test were Ai = 0.3, A2 = —0.1, A3 = 0.1. 

6.2.1 Pauli Channel Estimation with Nonoptimal Configuration 

First, we perform an experiment with the minimal POVM described by |25] . 
which is toniographically complete. In this case only one configuration is used with 
a total number of measurements ntot = n^ = 4500, and with the pure input state 

The characteristic quantities are plotted in Figure [3l 
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Figure 3: Estimation with nonoptimal configuration for the channel parameters Ai = 0.3, A2 
-0.1, A3 =0.1. 



6.2.2 Pauli Channel Estimation with Nonoptimal Input State 

In this case, the measurements were performed in the three optimal measurement 
directions using three experiment configurations, with n^ = 1500 measurements in 
each. The input state, however, was a nonoptimal pure input state 

in all three configurations. The characteristic quantities are plotted in Figure HI 

6.2.3 Pauli Channel Estimation with Optimal Experiment Configura- 
tion 

In this setup, both the input and measurement were optimal with respect to the 
known channel directions. The characteristic quantities are plotted in Figure [5l 

The results indicate, that the efficiency of the optimal experiment configuration 
highly outperforms the nonoptimal ones. We can also see that in the optimal setting, 
we can reach a very accurate estimation with only about n^ = 1000 number of 
measurements in each configuration. 
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Figure 4: Estimation with nonoptimal input state for the channel parameters Ai = 0.3, A2 ~ —0.1, 
A3 = 0.1. 
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Figure 5: Estimation with optimal configuration for the channel parameters Ai = 0.3, A2 
A3 = 0.1. 



= -0.1, 
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6.3 Generalized Pauli Channel in 3 Dimension 

In this section we show some examples on process tomography which demon- 
strates the differences between nonoptimal and optimal experiment configurations 
for the case of the qutrit generalized Pauli channel (i.e. with d = 3), based on nu- 
merical studies. The channel parameters in each test were Ai = —0.3, A2 = —0.2, 
A3 = -0.1 and A4 = 0.1. 

6.3.1 Nonoptimal Experiment Configuration 

The input state used in this experiment was the pure state 

1$) = -7^{\M + \M + \h,i) + I04,i)) 

which is constructed from the first vector \<pj^i) of each basis of the MUB, on the 
analogy of the Bloch vector 4=[1, 1, 1]""" in the two dimensional case. Based on this 
analogy, this state is expected to be sufficient for the characterization of the channel 
parameters. 

As we have discussed before, each channel parameter Aj affects the length of the 
projected input state Ei[p) in the subalgebra Ai independently. The channel has 
no other effect. Thus, we can measure the effect of the channel by focusing only on 
the subalgebras, and estimate the parameters independently. For this purpose, any 
Hermitian operator belonging to the subalgebra Ai can be considered as observable, 
and can be used for the estimation of Aj. The reason for this is that the effect of the 
channel is the same on the whole subalgebra, so it is enough to measure any direction 
inside Ai to get information on Aj. Moreover, the independence of the measurements 
on different subalgebras results in a diagonal covariance matrix. 

For example, valid observables for each subalgebra can be constructed using the 
MUB in the following way: 

d 

A = ^J\4>i,j){<Pi,j\ 

i=i 
This way we get observables with d different eigenvalues. Beyond that, the specific 
value of the eigenvalues are irrelevant, as we are interested only in the outcome 
probabilities. 

As it can be seen from the above, a total of four configurations were used in the 
experiments. The resulting estimations, and characteristic quantities are plotted in 
Fig. Ei More examples can be found in [TB] . 

We can see that even in the case of a qutrit channel with a high number of 
parameters, this method can provide very accurate parameter estimation, though 
the Hilbert-Schmidt norm does not seem to approach zero over the given range 
of the measurement number n^. Results from experiments performed with higher 
measurement numbers show that the reason of this is slower convergence. 
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Figure 6: Estimation with nonoptimal configuration for the channel parameters Ai 
-0.2, A3 = -0.1 and A4 = 0.1. 
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6.3.2 Optimal Experiment Configuration 

In this setup, both the input and measurement were optimal with respect to the 
known channel directions, as discussed in section [5l The characteristic quantities are 
plotted in Figure [5l 

The results indicate, that the efficiency of the optimal experiment configuration 
highly outperforms the nonoptimal one. We can also see that in the optimal setting, 
we can reach a very accurate estimation with only about n^ = 1000 number of 
measurements in each configuration. 

Note that each of the case studies presented in this section assume a known chan- 
nel model. As we pointed out in section 13.21 knowledge about the channel model 
could be available in some cases. It can be an interesting question however, that 
if there is no such information available, then can it be beneficial to try to obtain 
it, in our case possibly by using the method presented in section H] to estimate the 
channel directions? To answer this question, a comparison would be necessary be- 
tween our method of direction estimation combined with optimal experiment design 
and a channel estimation method that uses no a priori knowledge about the channel 
structure, from the aspect of resource requirement. This study is not in the scope of 
this work, but the papers [7] and [8] suggest that in order to achieve an accuracy of 
order comparable with the results given in section [6] without making assumptions 
on the channel can require a number of measurements of order 10^-10^. This is at 
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Figure 7: Estimation with optimal configuration for Ai 
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least about the same order as the approximate measurement requirement of our two 
step procedure. 

6.4 Robustness of the optimal experiment design 

It is often the case that the optimal experimental configurations designed to 
probe a set of parameters are sensitive to the assumed parts of the model used 
to derive the optimal settings. This can be an issue mainly because the direction 
estimation algorithm presented in section H] gives only approximate results. Thus, 
the aim of this subsection is to present a small example on the performance of the 
optimal experimental configuration for the case when the actual channel directions 
are slightly perturbed from the assumed channel directions. 

We know, that the channel directions |0i,i), |02,i); 103, 1) niust satisfy ([3]), i.e. 
each |0j,i) must be an element of a basis such that the three bases form a set of 
MUB. This tells us that the found (possibly inaccurate) channel directions must be 
transformed versions of the real channel directions by some unitary transform. In 
the qubit case, expressing the channel directions with a basis of Bloch vectors Vi, 
V2, and V3, this transform can be interpreted as the rotation of each Vj around a 
given axis a with a given angle a. 

The following example is a modified version of case study 16.2.31 The parameter 
estimation was done assuming that the {vj} basis represents the channel directions. 
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Figure 8: Example results on the robustness of the optimal experiment design for a qubit Pauli 
channel with parameters Ai = 0.3, A2 = —0.1, A3 = 0.1. On Figure [5al the Hilbert-Schmidt norm 
and on Figure I8bl the estimated parameter means were depicted in function of the perturbation 
parameter a after n^ ~ 1500 measurements in each tomography configuration. 

but the real channel was simulated using a perturbed basis {v'^}, where v'^ = i?a(Q^)vj, 
the matrix Rj^oi) being a rotation matrix. The axis of rotation a was given by the 
Bloch vector 4=[1, 1, 1]""". On Figure [Hal the Hilbert-Schmidt norm and on Figure [8b] 
the estimated parameter means were depicted in function of a after n^ = 1500 mea- 
surements in each tomography configuration. The Hilbert-Schmidt norm is clearly 
periodic. This is because in this example, after rotating the {vj} basis by the angle 
a = ^, we get the basis which is nothing but the starting basis {vj} with the order 
of the basis vectors permuted. This can be also seen on Figure [8b| where we get the 
valid parameter values again after rotating by a = ^, just in a different order. Of 
course from the aspect of robustness, we are only interested in small perturbations, 
i.e. small values of a. 

From this example, it can be seen, that considering the Hilbert-Schmidt norm, 
the optimal experimental configurations are indeed sensitive to the accuracy of the 
assumed channel directions, as the norm changes linearly for small perturbations 
with the perturbation parameter a (see Figure [8a|) . However, the results on Figure [8b[ 
suggest that the estimated mean values of the parameters change only quadratically 
for small a values. This means that the optimal parameter estimation method is 
robust in this sense. 



7 Conclusions 

Convex optimization-based parameter estimation and convex maximization-based 
experiment design methods were proposed in this paper for Pauli channels and for 
their generalized versions. 

A novel method for the parameter estimation of Pauli channel model families 
were developed for the known channel direction case based on convex optimization. 
This method results in a purely convex optimization problem, thus we can obtain a 
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globally optimal estimation with relatively simple and efficient numerical algorithms 
even in the generalized Pauli channel case. 

Furthermore, an efficient iterative method of estimating the channel directions 
was also proposed for the qubit Pauli channel case. The extension of this method to 
the general higher dimensional case is a possible direction of our further work. 

An experiment design procedure based on maximizing the Fisher information of 
the output of a generalized Pauli channel is also presented here. It is shown that the 
Fisher information is a convex function both in the input and in the measurement 
parameters. Therefore the optimal input state should be pure and the measurement 
POVM should be extremal. For qubit Pauli channels this formulation leads to an op- 
timal setting that includes pure input states and projective measurements directed 
towards the channel directions. A simple way of estimating the channel parame- 
ters in the optimal configuration is also given, and the robustness of the optimal 
configuration was considered. Further work will be directed towards the analytical 
generalization of this result to the generalized Pauli channel case. 

The effect of the optimal configuration compared to other widely used ones on 
the parameter estimation performance is demonstrated using case studies. 
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